The SuperARV Language Model: Investigating the Effectiveness of Tightly Integrating Multiple Knowledge Sources
نویسندگان
چکیده
A new almost-parsing language model incorporating multiple knowledge sources that is based upon the concept of Constraint Dependency Grammars is presented in this paper. Lexical features and syntactic constraints are tightly integrated into a uniform linguistic structure called a SuperARV that is associated with a word in the lexicon. The SuperARV language model reduces perplexity and word error rate compared to trigram, part-of-speech-based, and parser-based language models. The relative contributions of the various knowledge sources to the strength of our model are also investigated by using constraint relaxation at the level of the knowledge sources. We have found that although each knowledge source contributes to language model quality, lexical features are an outstanding contributor when they are tightly integrated with word identity and syntactic constraints. Our investigation also suggests possible reasons for the reported poor performance of several probabilistic dependency grammar models in the literature.
منابع مشابه
The robustness of an almost-parsing language model given errorful training data
An almost-parsing language model has been developed [1] that provides a framework for tightly integrating multiple knowledge sources. Lexical features and syntactic constraints are integrated into a uniform linguistic structure (called a SuperARV) that is associated with words in the lexicon. The SuperARV language model has been found able to reduce perplexity and word error rate (WER) compared...
متن کاملUsing Multiple-Variable Matching to Identify EFL Ecological Sources of Differential Item Functioning
Context is a vague notion with numerous building blocks making language test scores inferences quite convoluted. This study has made use of a model of item responding that has striven to theorize the contextual infrastructure of differential item functioning (DIF) research and help specify the sources of DIF. Two steps were taken in this research: first, to identify DIF by gender grouping via l...
متن کاملInvestigating the Position of Reason in the Method of Mystical and Philosophical Knowledge from the Perspective of Rumi and Ibn Sina
Reason is one of the sources of knowledge in human existence and human forces that distinguishes between good and evil or right and wrong. This has led many scientists and mystics to reflect on this issue among whom are Ibn Sina and Rumi. The purpose of this study is to study the nature of reason, its relationship with love and religion, factors and obstacles to reaching the perfection of ...
متن کاملOn Teaching to Diversity: Investigating the Effectiveness of MI-Inspired Instruction in an EFL Context
This study reports an experiment conducted to investigate the effectiveness of implementing MI-inspired instruction in an EFL context. To this end, a group of ten intermediate female students took part in a quasi-experimental study. At the beginning of the experiment, Multiple Intelligences Survey (Armstrong, 1993) was administered to determine the participants’ MI profiles. The participants we...
متن کاملInvestigating the relationship among complexity, range, and strength of grammatical knowledge of EFL students
Assessment of grammatical knowledge is a rather neglected area of research in the field with many open questions (Purpura, 2004). The present research incorporates recent proposals about the nature of grammatical development to create a framework consisting of dimensions of complexity, range and strength, and studies which dimension(s) can best predict the stat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002